3574 results found.
Vectors
Word Vectors,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution-Share-Alike License 3.0
Size:
2 million word vectors OtherProduction Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:The Connection between the Text and Images of News Articles: New Insights for Multimedia Analysis
-
Paper track:Multimodality/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Martha Larson | English word vectors | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
News Multimedia Analysis Data
Size:
1000 articles Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:The Connection between the Text and Images of News Articles: New Insights for Multimedia Analysis
-
Paper track:Multimodality/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Martha Larson | Oostdijk et al. Flood News Multimedia Analysis Data | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Newly created-in progress
Use:
-
Paper title:A Broad-Coverage Deep Semantic Lexicon for Verbs
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Choh Man Teng | COLLIE-V | /N |
Documentation:
Yes, available at the URL above
Transcribed Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
1616 utterances OtherProduction Status:
Newly created-finished
Use:
Dialogue
-
Paper title:Modeling Dialogue in Conversational Cognitive Health Screening Interviews
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shahla Farzana | DementiaBank DA Corpus | /N |
Documentation:
Yes
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Not Available
License:
Size:
10k sentences Production Status:
Newly created-in progress
Use:
Detection of reading absorption expressions
-
Paper title:Detection of Reading Absorption in User-Generated Book Reviews: Resources Creation and Evaluation
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Piroska Lendvai | Social Reading Absorption Corpus | /N |
Documentation:
None
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Newly created-finished
Use:
Text Mining
-
Paper title:Annotating and Analyzing Biased Sentences in News Articles using Crowdsourcing
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sora Lim | Biased Sentences in News Articles | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Size:
1000 GByte Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:A Corpus of German Reddit Exchanges (GeRedE)
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Andreas Blombach | Pushshift Reddit submissions and comments | /N |
Documentation:
Some English documentation at https://www.reddit.com/r/pushshift/comments/bcxguf/new_to_pushshift_read_this_faq/
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
2.4 GByte Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Correcting the Autocorrect: Context-Aware Typographical Error Correction via Training Data Augmentation
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kshitij Shah | Artificial Typo Corpora | /N |
Documentation:
Yes, English
Speech
Corpus,
Language Type:
Bilingual
Languages:
Egyptian Arabic English
Availability:
License:
Size:
12 hours Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:ArzEn: A Speech Corpus for Code-switched Egyptian Arabic-English
-
Paper track:Speech/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Injy Hamed | ArzEn | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
ELRA
Size:
20 hours Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition
-
Paper track:Speech/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Afroz Ahamad | AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition | /N |
Documentation:
https://accentdb.github.io




